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\  Summary 

—  The  generalized  Median  of  H.  Oja  yields  a  notion  of  bivariate 
quantile  and  in  turn,  an  affine  Invariant  bivariate  analogue  of  the 
sign  test.-  Its  properties  include  a  simple  null  covariance  formula, 
facilitating  a  permutation  or  sign  change  test  in  the  case  of 
bivariate  symmetry,  normal  efficiency  coinciding  with  that  of  the  Oja 
median,  and  bounded  Influence,  hence  strong  robustness. 


Key  words:  affine  invariance,  bivariate  quantile,  bivariate  symmetry, 
model,  generalized  median,  influence  function,  permutation  test, 
normal  efficiency,  robustness,  spatial  median. 
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1.  INTRODUCTION 

The  task  of  finding  affine-invariant  bivariate  procedures  which 
are  analogues  of  univariate  rank  Methods  is  not  straightforward;  see 
Barnett  (1976)  and  the  accompanying  discussion  for  some  of  the 
difficulties.  Recently,  however  Brown  and  Hettaansperger  (1967) 
derived  affine  invariant  bivariate  analogues  of  both  Vi  lcoxon  rank-sum 
and  signed  rank  tests,  in  bivariate  two-sample  and  one-sample 
symmetric  problems.  This  is  done  by  combining  the  bivariate  median 
of  Oja  (1963)  with  a  linear-model  approach  of  Jaeckel  (1972).  The 
Oja  median  minimizes  an  objective  function  which  is  the  sum  of  areas 
of  certain  triangles  and  the  gradients  of  this  function  yield  a  notion 
of  bivariate  quantile.  The  proposed  tests  are  genuine  Vi lcoxon 
analogues  through  involving  bivariate  "quantile"  rather  than 
univariate  rank.  For  tests  in  one  and  two-way  layouts  analogous  to 
the  Kruskal-Wallis  and  Friedman  tests  see  Brown  and  Hettaansperger 
(1966). 

In  addition  permutation  tests  are  available  through  conventional 
arguments  of  sign  change  in  the  one-sample  and  permutation  in  the 
two-sample  problem.  Null  covariance  matrices  have  simple  and  easily 
computable  forms,  yielding  convenient  large  sample  normal  or 
chi-squared  approximations. 

The  gradient  of  the  Oja  objective  function  is  just  the  bivariate 
quantile  of  the  tested  parameter  vector,  and  when  used  as  a  test 
statistic,  hereafter  called  the  Oja  sign  test  (06  test),  should 
constitute  a  univariate  sign  test  analogue.  However,  a  sign  change 
argument  cannot  be  applied  directly  to  the  06  test  to  find  its 
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conditional  covariance  Matrix.  Hence,  at  first  sight,  it  appears 
that  no  convenient  large  sample  test  is  available. 

The  present  paper  focuses  on  the  one-sasple  case  of  bivariate 
■yams  try,  and  does  two  things. 

(1)  It  is  shown  that  the  06  test  can  be  re  written  in  a  fora 

which  does  show  it  to  be  a  sign-test  analogue.  At  the  saae 
tine,  its  null  covariance  Matrix  reduces  to  a  staple  and 
easily  computable  fora,  and  large- saaple  approximations  are 
available. 

(li)  The  test  efficiency  for  normal  sampling  is  derived,  and 

shown  to  coincide  with  the  normal  estimation-efficiency  of 
the  Oja  median  (Oja  and  Niinimaa,  1965).  It  is  also 
possible  to  calculate  a  bivariate  version  of  the  influence 
function,  and  the  resulting  form  is  bounded,  as  is  the  case 
for  the  univariate  sign  test,  and  shows  the  OS  test  to  be  of 
high  robustness. 

Sections  2  and  3.4  contain  (i)  and  (li)  respectively. 

2.  FORMULATION 

Some  material  from  Brown  and  Hettmansperger  (1967)  is  now 
stnmarlzed  briefly.  Let  x^.....xn,8  be  2x1  vectors  with  {x^} 
independent  and  with  distribution  symmetric  about  6q.  The  Oja 
objective  function  is 

T(0)  A(Xi.Xj.e) 

i<j 

where  A(a.b.c)  is  the  area  of  a  triangle  whose  vertices  are  a.b.c. 

The  Oja  generalized  median  0  is  the  choice  of  0  to  minimize  T.  The 
quantile  of  0  is  the  vector  whose  components  are  derivatives  of  T  with 
respect  to  components  of  0;  it  is 
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<H»)  -  5  J  J  ®)  (i) 

KJ 

Atra  the  "repulsion  vector"  u(Xj  ,x  f ;  0)  has  magnitude  |x1~x,.  |  and 
direction  perpendicular  to  and  away  froa  the  chord  (Xj ,x^ ^  towards  0. 

A  bivariate  sign  test  analogue  uses  Q(0q)  to  test  H^:  0  «  0q:  in  the 
univariate  case  the  sign  test  statistic  is  exactly  the  centered 
quantile.  The  null  hypothesis  is  rejected  when  Q(0q)  is  far  froa  the 
zero  vector,  as  aeasured  by  a  quadratic  fora  in  Q(0q)  . 

In  what  follows,  take  0q  *  0  without  loss  of  generality  and  let 
Ujj  *  u(Xj.Xj:  0).  Vjj  *  utXj.-Xj:  0).  The  quantile  Q  as  expressed 
in  (1)  is  not  an  obvious  slgn~test  analogue. 

But  write  u^  -  |{(utJ  ♦  Vjj)  ♦  (u^  -  v^)}.  Some  simple 
geometry  shows  that 

Uu  ♦  vu  ■  “<w  -*i> 

"lj  '  Y1J  -  <■(*,  -*,!  -*j)- 
Substituting  in  (1)  gives 

Q(°)  *  5  J  J  U(XJ,_Xj;  "*1)  (2) 

ii*J 


J 

(J*i) 

An  alternative  expression  for  Q  coaes  also  froa  (2).  Note  that 
for  fixed  Xj.  all  u(Xj,-Xj;  -x^)  are  perpendicular  to  (-Xj.Xj)  with 
aagnltude  2 |x^ | .  and  direction  determined  by  which  side  of  the  chord 
(-Xj.Xj)  the  point  -Xj  falls.  Thus,  let  the  extended  chord  (-Xj.Xj) 


divide  the  plene  into  half -planes  P+ , P_ ;  let  a^  be  the  vector 
rotated  counter-clockwise  through  |v.  so  that  u(x^ ,-x^ ;  -Xj)  »  ±  2a ^ 
for  all  i,  and  let  P+  be  the  half  plane  into  which  a^  points.  Let 
r^.  Sj  be  the  nunbers  of  {-Xj.  i  ji  J}  €  P+,P_  respectively.  Then 
suasiing  first  over  1  in  (2)  yields 

Q(°)  -  \  l  •  C) 


*fa*r*  Bj  *  rj  -  *J ' 

The  representation  (4)  nay  provide  the  best  way  to  calculate  Q. 
but  the  easiest  derivation  and  coaputatlon  of  the  covariance  aatrix  of 
Q  corns  free  (3).  The  analogue  to  the  sign  test  is  also  best  seen 
fron  (3).  since  each  in  sane  sense  Measures  the  position  of  x^ 
relative  to  the  rest  of  the  syanetrized  saaple. 

Under  the  assumption  of  bivariate  ■yentry  {Xj}  is  a  realization 
of  {SfXj}  where  {»^}  are  Independent  randon  variables  each  equalling 
1  1  with  probabilities  Clearly  Qj  depends  on  all  {Sj}  only 

through  Sj.  Therefore,  conditional  on  the  collection  {±  x  } . 

Qj»  QjfXj)  and  Q^C*^)  *  *iQ1(x1).  and  (3)  is  a  sun  of  independent 
randosi  variables  with  vector  coefficients.  Thus  E(Q)  *  0  and  the 
null  covariance  Matrix  of  Q(0)  is 


C  c 


i 


(5) 


A  permutation  or  "sign-change"  test  against  a  general  alternative 
nay  be  carried  out  as  follows.  Let 


Q 


s 


Generate  all  2°  possible  values  for  {s . }  and  hence  for  Q  ;  refer  the 

X  s 

observed  value  of  Q^C  *Q  to  the  population  of  2n  values  of  Q^C  and 
calculate  a  signif ic lance  level  accordingly. 
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If  n  is  large,  this  exact  test  can  be  Modified  to  a  Monte  Qsrlo 

test  ehere  the  (s^}  are  generated  at  randoa.  Alternatively,  a  large 

saaple  approximation  is  available.  Conditional  on  (Qj),  each  Q#  is 

the  sum  of  independent  randoa  vectors  and  appraxlaately  normal  for 

large  n.  with  covariance  matrix  C.  Thus  the  approximate  null 

T  —1 

distribution  to  which  Q  C  Q  should  be  referred  is  xf. 


3.  EFFICIENCY 

Tests  of  Hq:  0  *  0Q  are  based  on  Q(0Q)  *  g  J  1<j  u(Xj.Xj;  0Q). 

Again  without  loss  of  generality  now  take  0q  ■  0.  The  null 

i  r  t 

covariance  matrix  of  Q(0)  is  C  ■  jg  )  :  *oo  (5).  The  actual 

test  statistic  is  QT(0)C“1Q(0) .  Let  B  «E(C);  the  U-statlstlc-like 

structure  of  C  shows  that  n  (C-B)  converges  almost  surely  to  zero  as 

n  «*  •,  as  long  as  (Xj)  are  drawn  from  an  lntegrable  bivariate 

T  -1 

distribution.  The  asymptotic  behaviours  of  Q  (0)C  Q(0)  and 
T  -1 

Q  (0)B  Q(0)  therefore  coincide,  and  in  assessing  efficiency  via  a 

-1/2 

sequence  of  alternatives  within  0(n  )  of  the  null,  it  is  easy  to 

T  —1 

show  that  the  asymptotic  distribution  of  Q  B  Q  is  noncentral  xi  with 
noncentrality  parameter 

0TDTB_1D0. 

-1/2 

where  0  =  n  P,  a  =  Eq{Q(0)}  and  D  is  the  matrix  of  derivatives  of  a 

with  respect  to  components  of  0.  Thus  (see  Blckel.  1965)  the  Pitman 

efficacy  of  the  06  test  appears  to  depend  on  the  direction  of  the 

T  -1 

alternative,  p.  However,  it  will  turn  out  that  D  B  D  is 
proportional  to  an  orthogonal  matrix  (see  Propositions  1  and  2),  so 
the  noncentrality  parameter  in  fact  does  not  depend  on  the  direction 
of  P.  Taking  0  to  be  a  unit  vector,  a  large  saaple  efficiency  factor 


for  tho  test  Is  Just 
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e  »  0TDTB "1HP.  (6) 

T  -1 

which  will  equal  the  cession  eigenvalue  of  D  B  D. 

Now  consider  noraal  efficiency  of  the  OS  test  relative  to  the 
least  squares  t-test.  Both  OS  and  t-  test  are  affine  invariant,  so 
{Xj}  any  be  assuaed  independent  NfO,^).  the  bivariate  circular  normal 
distribution. 

It  is  easy  to  verify  that  for  least  squares  D  «  I^.  B  =  n 
and  hence  that  e^g  *  n. 

For  the  OS  test,  the  calculation  of  e^g  is  broken  into  two  parts, 
first  the  calculation  of  a.D  and  second  the  calculation  of  B. 

PROPOSITION  1.  D  »  i2. 

Proof.  In  this  proof  the  direction  of  a  line  is  taken  to  be  an  angle 
y,  or  t+v;  that  is.  it  is  immaterial  whether  forwards  or  backwards 
orientation  of  the  line  is  used. 

First  note  that  Xj  -  and  ^(x^+Xj )  are  independent,  and  hence 
that  conditioning  on  the  direction  a  i  gw  of  -  x^  does  not 
influence  g(Xj+Xj).  The  repulsion  vector  u(Xj,Xj:  8)  has  direction 
a  or  o  +  v.  Since  (Xj.Xj)  and  its  negative  are  equi-probable,  the 
contribution  to  E{u(Xj,Xj;  6)}  from  all  (Xj.x^)  is  zero  apart  from 
those  (Xj.Xj)  for  which  u(Xj,Xj;  0)  *  u(-Xj,-Xj;  0).  Projecting  onto 
the  direction  a  of  the  vector  u,  this  condition  means  that  the 
projection  of  0  cannot  lie  between  those  of  the  mid-points  |(xi'fxj)> 

-  §(xi,fxj)  chords  joining  (Xj.Xj)  and  (-Xj.-Xj).  That  is. 
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0  <  -  0>T(““]-<-  $<*,«,)  -  e)T(““] 


so  the  only  Xj.Xj  asking  non-zero  contribution  to  E{u(xi.Xj:  0)} 
satisfy 

<  Ibt(““]i  m 

Recall  that  conditional  on  the  direction  a,  l.e.  on 

*  0. 

gCXj+Xj)  sx  N(0.|l2)  so  for  saall  |e | .  the  probability  of  (7)  is 

2r~l/2|8T(”“]  |  ♦  o(|8|)  (8) 

The  vector  uJXj.Xj;  0)  has  magnitude  hj-Xj |  and  is  in  the 
T 

direction  (cosa.  sina)  with  sign  as  yet  unspecified.  To  get  the 
sign,  project  both  0  and  the  chord  mid-point  ^Cx^Xj)  on  to  the 
direction  a,  and  note  that  u  points  away  from  the  chord  towards  0.  by 
definition,  so  that 

“'W  e>  ■  I v*j  I  (“»]  S*"C(0  -  5<V*j»Tfe]]- 

However,  for  (Xj.Xj)  obeying  (7),  the  sign  factor  =  sgn[(cosa, sina)0] . 
Since  E{|xj-Xj|}  =  2w  combining  with  (8)  gives  the  corresponding 
contribution  to  E{Q(0)}  of 


Li  0T  fco«x'| 
2w°  [since  J 


Ed  *  Kltl). 


2  f  co s*a  cosa  sina)  a  n,ial\ 

3  t  [cosa  sina  sin’a  J  ®  +  o(|0|). 

Nov  a  is  a  uni  fora  angle,  with  E(cos2a)  =  |  =  E(sin2a) , 
E(cosa  sina)  *  0,  so  finally 

a(0)  -f^^^le  +  odel).  and 


D  = 


n(n-l)  . 
2r  l2 


(9) 


PROPOSITION  2. 


a  =  ~  l2  ♦  o(n3). 


Proof.  Referring  to  (5).  a  typical  tern  of  C  is  proportional  to 

T  T 

U*XJ,_Xj;  ~xi^u  (xk*”Xk:  "xi^  +  u(xk,-xk:  _xi^u  (xj-~xj:  "xi^- 
T  11 

Let  Xj  =  r t {cos (aj+gr),  sin(at+j£v)} .  By  the  independence  of 
{r^},  {a^},  the  expectation  of  such  a  typical  tera,  given  Xj.x^,  is 


4rjrk 


?0j°k  ''jWj1 

23  i\ 


<l-;lvkl) 


(10) 


where  (c^s^  =  (cosa^,  sina^),  and  where  in  the  last  term,  the  factor 
is  the  shortest  absolute  rotation  between  and  a^.  and  thus 
€  [O.v].  This  factor  arises  from  considering  the  possible  positions 
of  x^  in  the  four  sectors  defined  by  chords  (x^.-x 
the  corresponding  signs  of  repulsion  vectors  u. 


j)  and  (xk.-xk),  and 
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But  Cfr  .  jfco.^^)  ♦  coufa^)}.  Cj^  ♦  Vj  -  slnCa^). 
and  SjS^  =  2(008(0^-0^)  -  008(0^+0^)}.  Now  Oj+Oj  and  Oj-a^  are 
Independent  and  each  is  distributed  as  U(-v,t).  with  cos  and  sin  of 
0^+0^  having  mean  zero,  so  averaging  over  Oj.a^  reduces  (10)  to 


4  rjrk E  ccoa(aj*Uj) [0  ?]  <» - 

4  fl  0l 
*  4  rJrk*va  [0  lj ‘ 

1/0 

Since  E(**j)  =  E^^)  “  (2/*)  •  the  final  result  is: 


Applying  the  formula  (6)  with  (9)  and  (11)  gives 


e0S  =  f  +  °Cn>- 

and  since  e^g  =  n,  the  required  efficiency  is 

efficiency  (0S:LS)  =  \  =  -785. 

This  efficiency  is  greater  than  2/r  =  .637,  the  normal  efficiency 
of  the  univariate  sign  test,  and  the  result  agrees  with  the  estimation 
efficiency  of  the  Oja  median  for  bivariate  normal  data;  see  Oja  and 
Niininaa  (1985) 


-li¬ 


lt  is  also  possible  to  calculate  the  normal  efficiency  of  the  06 
test  relative  to  the  componentwise  sign  test,  a  non-afflne  invariant 
test  whose  components  are  sign  test  statistics  in  the  two  co-ordinate 
directions.  Consider  bivariate  normal  data  with  covariance  matrix 

Then  if  X  is  N(0.1a).  Z  >  AX  Is  N(0,A2).  But  by  affine  invariance. 
®0g(A)  *  eog(I2^* 

Letting  S  be  the  vector  of  sign  test  statistics  based  on 
components  of  Z, 

. .  Ee{S) .  „  ( [:»<£>  ] 


*  -n  (2/v)1/2  6  +  o(|0|). 


>1/2 


so  that  D  *  -n  (2/w)  I0.  Also,  at  0  *  0 


cov(S)  .  n  f  .  1  2*''«.ln(p)  1  _  „ 

^  2t  arsin  (p)  1 


T  —1 

The  resulting  expression  for  D  B  D  has  unequal  eigenvalues 

2w  *n  (1  ±  2n  *arsln  p)_1  and  the  efficiency  factor  eg  for  the 

componentwise  sign  test  lies  between  these  two  values.  The  range  of 

the  resulting  relative  efficiencies  is 

t2  -1 

efficiency  (0S:S)  =  eQg/eg  =  g  (1  ±  2v  arsin  (p) ) 
with  the  actual  efficiency  depending  on  the  direction  of  the 
alternative,  as  might  be  expected  from  the  non-afflne  invariant  nature 
of  the  S  test.  The  median  of  these  efficiencies  is  v3/8  =  1.234, 
indicating  that  the  06  test  is  generally  more  efficient  than  the 
componentwise  sign  test. 
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8.  ROBUSTNESS 

It  la  convenient  to  describe  robustness  of  the  OS  test  in  terns 
of  a  bivariate  analogue  of  Hampel's  (1974)  influence  function.  The 
latter,  though  usually  defined  as  a  von-Mlses  derivative  of  certain 
functionals,  can  also  be  specified  for  tests,  in  the  context  of 
asymptotic  theory,  in  the  following  way,  covering  both  bivariate  and 
univariate  cases. 

Let  T  be  a  noraalized  test  statistic  based  on  a  large  number  n  of 

observations,  whose  null  distribution  is  standard  normal.  Suppose 

samplii^s  is  from  a  distribution  contaminated  at  a  fixed  point  x,  that 

is  with  probability  1-e.  sampling  is  from  the  hypothesized  parent 

distribution,  but  with  probability  e.  an  observation  is  x.  If 

-1/2  -1/2 

contamination  is  0(n  ),  i.e.  t  *  cn  for  some  c  >  0.  then 

typically  the  aayaptotlc  variance  or  covariance  matrix  of  T  is 

unaffected,  but  the  null  mean  -*  cfl  as  n  -*  ».  The  vector  (2  *  (2(x)  is 

the  influence  due  to  contamination  at  x;  its  presence  imposes  a  bias 

on  the  asymptotic  null  distribution  and  a  distortion  of  test  levels. 

To  evaluate  (2  for  the  OS  test,  the  null  mean  and  covariance 

-1/2 

matrix  of  Q(0)  under  0(n  )  contamination  are  required;  as  before 

assume  the  parent  distribution  to  be  N(0,  I^) - 

It  is  easy  to  see  that  covariance  is  unaffected  asymptotically, 
and  that  as  previously  calculated 

cov{Q(0)}  *B^I2  (12). 

To  evaluate  E{Q(0)},  use  the  form  (4);  then 

E{Q(0)>  «  |n(n-l)  E(pa). 
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where  if  is  a  typical  observed  vector.  |a|  «  |x^|  and  a  is 

perpendicular  to  Xj .  pointing  into  the  half  plane  P+.  and  if  p+,p_  are 

probabilities  of  other  observations  in  P+,P_,  then  p  *  p+-p_ .  If  x^ 

is  fro*  the  contaainant  x,  then  p  ■  0  but  if  x^  is  froa  N(0.  Ig).  then 

p  ■  6  if  x  C  P+,  but  p  s  -e  if  x  €  P_.  Averaging  over  Xj  positions. 

using  the  independence  of  orthogonal  coaponents  of  N(0,  Ig)  and  the 

1/2 

fact  that  expected  absolute  value  of  N(0.1)  is  (2/v)  yields 

E{Q(0)}  »  \  n(n-l)  e(l-e)  (2/v)1/2  ux. 

where  ux  is  a  unit  vector  in  the  direction  of  x. 

-1/2 

Now  let  e  ■  cn  and  calculate  the  aean  of  the  normal  ized 
statistic  B  '^(O).  i.e. 

B~1/2  |n(n-l)  cn‘1/2(l-cn"1/2)  (2/v) 1/2  ux 
-1/2 

which  froa  (12)  approaches  2  vc  ux  as  n  •*  •».  Thus  for  the  06 
test,  the  Influence  function  is 

0(x)  «  2" 1/2  vux. 

In  depending  only  on  the  direction  and  not  magnitude  of  the 

contaainant  position  x,  this  is  analogous  to  influence  for  the 

-1/2 

univariate  aedian.  The  factor  2  is  attributable  to  the  normal 
parent  distribution.  In  having  bounded  Influence,  the  OS  test  has 
high  robustness. 

Another  bivariate  sign  test  analogue,  though  not  affine 
invariant,  is  the  angle  test,  corresponding  to  the  spatial  median  (see 
Brown.  1983).  Siallar  but  easier  calculations  show  the  standard 
normal  influence  function  for  angle  tests  to  be 

n(x)  -  21/2  ux. 


-  14  - 


which  is  also  bounded,  and  nedlan-analogous ,  with  a  smaller  constant 
1/2  -1/2 

2  The  larger  constant  2  w  for  the  06  test  can  be  seen  as  a 

■odest  price  to  pay  for  the  iaportant  property  of  affine  invariance. 

Other  Influence  functions  which  are  readily  calculated  are 
Q(x)  *  x  for  Hotelling’s  test  based  on  the  bivariate  saaple  mean,  and 
therefore  of  unbounded  influence  as  expected,  and  for  the 
componentwise  sign  test,  where  the  test  vector  has  univariate  sign 
tests  as  coaponents.  R(x)  *  2  u  ,  where  u^  is  a  unit  vector 
splitting  the  quadrant  containing  x.  Thus  the  latter  test,  which  is 
not  affine  invariant,  also  has  bounded  influence. 
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